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1 MemoriES^ H 
server design 

Ashwini Nanda, Kwok-Ken Mak, Krishnan Sugarvanam, Ramendra K. Sahoo, Vijayaraghavan 
Soundarararjan, T. Basil Smith 

November 2000 Proceedings of the ninth international conference on Architectural 

support for programming languages and operating systems, volume 34 , 

28 Issue 5,5 

|| i i ' -mfc .r/-7^ rf i-o ls^^^ Additional Information: full citation, abstract, references, citings, index 

Full text available: fltpdfi /24.53 KB i ' ' ' 

^ * terms 

Modern system design often requires multiple levels of simulation for design validation and 
performance debugging. However, while machines have gotten faster, and simulators have 
become more detailed, simulation speeds have not tracked machine speeds, As a result, it 
is difficult to simulate realistic problem sizes and hardware configurations for a target 
machine. Instead, researchers have focussed on developing sealing methodologies and 
running smaller problem sizes and configurations that atte ... 

2 MernorlES: a programmable, reai-time hardware emulation tool for muitiprocessor B 
server design 

Ashwini Nanda, Kwok-Ken Mak, Krishnan Sugavanam, Ramendra K. Sahoo, Vijayaraghavan 
Soundararajan, T. Basil Smith 

November 2000 ACM SIGPLAN Notices, Volume 35 issue n 

Full text available: B ^odf{1.64 MB) Additional Information: full citation , abstract, references , index terms 

Modern system design often requires multiple levels of simulation for design validation and 
performance debugging. However, while machines have gotten faster, and simulators have 
become more detailed, simulation speeds have not tracked machine speeds. As a result/it 
is difficult to simulate realistic problem sizes and hardware configurations for a target 
machine. Instead, researchers have focussed on developing scaling methodologies and 
running smaller problem sizes and configurations that atte ... 

3 FPGA-b^ H 
niicroprgcessors 

Nobuyuki Ohba, Kohji Takano 

June 2004 Proceedings of the 41st annual conference on Design automation - Volume 
OO 

Full text available: 'g j odf(75.56 KB) Additional Information: full citation, abstract, references, index terms 
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In System on Chip (SoC) design, growing design complexity has forced designers to start 
designs at higher abstraction levels. This paper proposes an SoC design methodology that 
makes full use of FPGA capabilities. Design modules in different abstraction levels are all 
combined and run together in an FPGA prototyping system that fully emulates the target 
SoC. The higher abstraction level design modules run on microprocessors embedded in the 
FPGAs, while lower-level synthesizable RTL design module ... 

Keywords: ASIC, FPGA prototyping, SoC, mixed-level verification 



4 Ihe Transit Q 
David M. Lewis, David R. Galloway, Marcus van Ierssel, Jonathan Rose, PauLChow 
February 1997 Proceedings of the 1997 ACM fifth international symposium on Field- 
programmable gate arrays 

Full text available: f^ pdff1.31 MB) Additional Information: full citation, references, citings, index terms 



5 Rapid prototyping of ASiC based systems B 
P. H. Kelly, K. J. Page, P. M. Chau 

June 1994 Proceedings of the 31st annual conference on Design automation 

Full text available: ^.pdfi139 s 19. KB} Additional Information: fujl.citation, references, index.terms 



6 Expioiti^ 

Karlheinz WeiB, Thorsten Steckstor, Gemot Koch, Wolfgang Rosenstiel 

February 1999 Proceedings of the 1999 ACM/SIGDA seventh international symposium 
on Field programmable gate arrays 

Full text available: pdf(2.02 MB) Additional Information: full citation, references, citings, index terms 



7 Board-level multiterminal net routing for FPGA-based logic emulation 
Wai-Kei Mak, D. F. Wong 

April 1997 ACM Transactions on Design Automation of Electronic Systems (TODAES), 

Volume 2 Issue 2 

Additional Information: full citation, abstract, references, citings, index 
terms, review 

We consider a board-level routing problem applicable to FPGA-based logic emulation 
systems such as the Realizer System [Varghese et al. 1993] and the Enterprise Emulation 
System [Maliniak 1992] manufactured by Quickturn Design Systems. Optimal algorithms 
have been proposed for the case where all nets are two-terminal nets [Chan and Schlag 
1993; Mak and Wong 1995]. We show how multiterminal nets can be handled by 
decomposition into two-terminal nets. We show that the multiterminal net decomp ... 

Keywords: board-level routing, crossbars, field programmable gate arrays, logic 
emulation, multi-terminal net decomposition 



8 The desjgnMBP 

Koray Oner, Luiz A. Barroso, Sasan Iman, Jaeheon Jeong, Krishnan Ramamurthy, Michel 
Dubois 

February 1995 Proceedings of the 1995 ACM third international symposium on Field- 
programmable gate arrays 
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Full text available: ^pdf(54.Q1 KB) Additional Information: Ml-Citatjon, abstract references, citings, index 

terms 

Recent advances in Field-Programmable Gate Arrays (FPGA) and programmable 
interconnects have made it possible to build efficient hardware emulation engines. In 
addition, improvements in Computer-Aided Design (CAD) tools, mainly in synthesis tools, 
greatly simplify the design of large circuits. The RPM (Rapid Prototype Engine for 
Multiprocessors) Project leverages these two technological advances. Its goal is to develop 
a common hardware platform for th ... 

Keywords: field-programmable gate arrays, logic emulation, message-passing 
multicomputers, rapid prototyping, shared-memory multiprocessors 



9 Wormhole„IP.oye 

Manolis G. H. Katevenis, Iakovos Mavroidis, Georgios Sapountzis, Eva Kalyvianaki, Ioannis 
Mavroidis, Georgios Glykopoulos 

October 2001 IEEE/ACM Transactions on Networking (TON), Volume 9 issue 5 

Full text available: ^ pdf(211.25 KB) Additional Information: full citation, abstract, references, citings, index 

High-speed switches and routers internally operate using fixed-size cells or segments; 
variable-size packets are segmented and later reassembled. Connectionless ATM was 
proposed to quickly carry IP packets segmented into cells (AAL5) using a number of 
hardware-managed ATM VCs. We show that this is analogous to wormhole routing. We 
modify this architecture to make it applicable to existing ATM equipment: we propose a 
low-cost, single-input, single-output Wormhole IP Router that functions as a VP/ ... 

Keywords: Connectionless ATM, IP over ATM, gigabit router, routing filter, wormhole IP, 
wormhole routing 



10 QfioptjmaLM B 
Wai-Kei Mak, 6. F. Wong 

January 1995 Proceedings of the 32nd ACM/IEEE conference on Design automation 

Full text available: ^ fxif( 200.79 KB) Additional Information: full citation, references, citings, index terms 



11 Board-ievel multi-terminal net routing for FFGA-based logic emuiation H 
Wai-Kei Mak, D. F. Wong 

December 1995 Proceedings of the 1995 IEEE/ACM international conference on 
Computer-aided design 

Full text available: ^pd£22126.KBj Additional Information: full citation, abstract, references, citings, index 

^Pyb!isher.Sjte ^rms 

We consider a board-level routing problem applicable to FPGA-based logic emulation 
systems such as the Realizer System [3] and the Enterprise Emulation System [5] 
manufactured by Quickturn Systems. Optimal algorithms have been proposed for the case 
where all nets are two-terminal nets [10,11]. In this paper, we show how multi-terminal 
nets can be handled by decomposition into two-terminal nets. We show that the multi- 
terminal net decomposition problem can be modelled as a bounded-degree hypergra ... 

Keywords: board-level routing, bounded-degree hypergraph-to-graph transformation, 
FPGA, logic emulation systems, multi-terminal net decomposition net 
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12 Prototyping, verification, and test: Implementation of BEE: a real-time large-scale 
hardware 

Chen Chang, Kimmo Kuusilinna, Brian Richards, Robert W. Brodersen 

February 2003 Proceedings of the 2003 ACM/SIGDA eleventh international symposium 
on Field programmable gate arrays 

Full text available: ' gj pdf(3.65 MB) Additional Information: full citation , abstract, references, index terms 

This paper describes the hardware implementation of a real-time, large-scale, multi-chip 
FPGA (Field Programmable Gate Array) based emulation engine with a capacity of 10 million 
ASIC (Application Specific Integrated Circuits) equivalent gates. Attainable system 
operation frequency can exceed 60 MHz, and the system throughput has been empirically 
verified to achieve 600 billion 16-bit additions per second. The emulator is custom designed 
to maximize the performance and resource utilization for a ... 

Keywords: FPGA, hardware emulation, rapid-prototyping 



13 Module generation of complex macros for logic-emulation applications 
Wen-Jong Fang, Allen C.-H. Wu, Duan-Ping Chen 

February 1997 Proceedings of the 1997 ACM fifth international symposium on Field- 
programmable gate arrays 

Full text available: S.pdff 1..48.MB} Additional Information: fuJJ.citatjon, references, citings, index .terms 



14 A_hybr|^ H 
Mohammed A. S. Khalid, Jonathan Rose 

March 1998 Proceedings of the 1998 ACM/SIGDA sixth international symposium on 
Field programmable gate arrays 

Additional Information: full citation, abstract , references, citings, index 



Full text available: W odff 1.12MB} 

lerms 

Multi-FPGA systems (MFSs) are used as custom computing machines, logic emulators and 
rapid prototyping vehicles. A key aspect of these systems is their programmable routing 
architecture; the manner in which wires, FPGAs and Field-Programmable Interconnect 
Devices (FPIDs) are connected. Several routing architectures for MFSs have been proposed 
[Arno92] [Butt92] [Hauc94] [Apti96] [Vuil96] and previous research has shown that the 
partial crossbar is one of the best existing architectures [Kim9 ... 

15 Reconfigurable systems: SmartGlue: an interface controller with auto reconfiguration 

for field programmable computing machine 

Young-II Kim, Bong-Il Park, Jae-Gon Lee, Chong-Min Kyung 

January 2004 

Full text available: flpdf(1 1 3. 02. KB) 

(Sf Additional Information: full citation, abstract , references 

^.Publisher Site 

This paper describes an interface controller called SmartGlue which enables a processor to 
interface with peripherals and makes a system reconfigurable by programming FPGA on the 
fly. By supporting standard interfaces and plug and play mechanism for the processor and 
FPGA, one can use any type of processors and FPGA's to implement a field programmable 
computing machine. The performance and utility of the SmartGlue was validated by 
applying its silicon implementation into a real system. 

16 Software for Reconfigurable Systems: Incremental reconfiguration of muiti-FPGA 
systems 

K. K. Lee, D. F. Wong 
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February 2002 Proceedings of the 2002 ACM/SIGDA tenth international symposium on 
Field-programmable gate arrays 

Full text available: ^.pdgi46 i 68„KBl Additional Information: Mljcitatjon, abstract, references 

In reconfigurable computing, circuits implemented on multi-FPGA systems have to be 
incrementally modified. Since reconfiguring an FPGA is time-consuming, the time for 
reconfiguration depends on the number of FPGAs to be reconfigured. Our objective is to 
reduce the number of such FPGAs. In this paper, we consider the specific problem of 
incrementally reconfiguring a multi-FPGA system that utilizes the direct interconnection 
architecture, where routing connections between FPGAs are to neighbors th ... 

17 Poster session 2: A comparison between mask- and field-programmable routing Q 

structures on industrial FPGA architectures 

Luca Macchiarulo, Consolato F. Caccamo, Davide Pandini 

April 2004 Proceedings of the 14th ACM Great Lakes symposium on VLSI 

Full text available: ^p„df(1J6A3.KBj Additional Information: MLsjtation, abstract, references, indexjerms 

In this paper we compare the routing architecture of island-style FPGAs based on field- 
programmable switch boxes with a mask-programmable routing structure, in order to 
assess its position in the design space of routing opportunities available to VLSI IC 
designers. Although the results presented in this work depend on a few implementation 
details that will be discussed in the paper, the mask-programmable routing structure shows 
a large area saving and delay improvement with respect to the field- ... 

Keywords: FPGA, interconnect architectures, mask-programmable, routing 



18 Placement:. Ha^ 
placement 

Michael G. Wrighton, Andre M. DeHon 

February 2003 Proceedings of the 2003 ACM/SIGDA eleventh international symposium 
on Field programmable gate arrays 

Additional Information: fgjl.citatjon, abstract, references, citinas, index 



Full text available: fH pdft503.33 KBj 

^ terms 

To truly exploit FPGAs for rapid turn-around development and prototyping, placement times 
must be reduced to seconds; late-bound, reconfigurable computing applications may 
demand placement times as short as microseconds. In this paper, we show how a systolic 
structure can accelerate placement by assigning one processing element to each possible 
location for an FPGA LUT from a design netlist. We demonstrate that our technique 
approaches the same quality point as traditional simulated annealing as ... 

Keywords: design automation, field-programmable gate arrays, placement, reconfigurable 
computing, simulated annealing 



19 Multi-terminal net routing for partial crossbar-based muiti-FPGA systems Q 
Abdel Ejnioui, N. Ranganathan 

February 1999 Proceedings of the 1999 ACM/SIGDA seventh international symposium 
on Field programmable gate arrays 

Full text available: SpdfQ.fiS.MBi Additional Information: MLciMton, Merences, citings, index ternis 



Keywords: FPGA architecture, FPGA routing, branch-and-price, integer programming, 
interconnect optimization, layout synthesis 
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20 Multiway FPGA partitioning by fully exploiting design hierarchy 
Wen-Jong Fang, Allen C.-H. Wu 

January 2000 ACM Transactions on Design Automation of Electronic Systems 

(TODAES), Volume 5 Issue 1 

Additional Information: fujj.cjtaiipn^ abstract, references, citjnss, Index 



Full text available .. . . . ... , 

LiJ terms 

In this paper, we present a new integrated synthesis and partitioning method for multiple- 
FPGA applications. Our approach bridges the gap between HDL synthesis and physical 
partitioning by fully exploiting the design hierarchy. We propose a novel multiple-FPGA 
synthesis and partitioning method which is performed in three phases: (1) fine-grained 
synthesis, (2) functional-based clustering, and (3) hierarchical set-covering partitioning. 
This method first synthesizes a design specification in ... 

Keywords: fine-grained synthesis, functional clustering, multi-way partitioning, multiple- 
FPGA synthesis 
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1 Performs 

Guang-Ming Wu, Jai-Ming Lin, Yao-Wen Chang 

October 2002 ACM Transactions on Design Automation of Electronic Systems 

(TODAES), Volume 7 Issue 4 
Full text available: °f| | pdf(221.44 KB) Additional Information: full citation, abstract, references, index terms 



In this article, we introduce a new placement problem motivated by the Dynamically 
Reconfigurable FPGA (DRFPGA) architectures. Unlike traditional placement, the problem for 
DRFPGAs must consider the precedence constraints among logic components. For the 
placement, we develop an effective metric that can consider wirelength, register 
requirement, and power consumption simultaneously. With the considerations of the new 
metric and the precedence constraints, we then present a three-stage scheme of ... 

Keywords: Computer-aided design of VLSI, dynamically reconfigurable, field- 
programmable gate array, layout, placement 
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Depth optimal incremental mapping for field programmable gate arrays | 
Jason Cong, Hui Huang 

June 2000 Proceedings of the 37th conference on Design automation 

i- n* * i ui 0t wMcfi rtc i/tv Additional Information: full citation, abstract, references^ citincs, index 

Full text available: "P hxjf(166.9fr KB» J ' J W " J 

terms 

In this paper, we study the incremental technology mapping problem for lookup-table (LUT) 
based Field Programmable Gate Arrays (FPGAs) under incremental changes. Given a gate- 
level networks, a mapping solution associated with it, and a sequence of changes to the 
original network, we compute a new mapping solution by modifying the existing one. 
Moreover, we assume that the given mapping solution is depth-optimal and we are required 
to come up with a modified mapping solution that maintains t ... 

Tools: Incremental physical resynthesis for timing optimization | 
Peter Suaris, Lungtien Liu, Yuzheng Ding, Nanchi Chou 

February 2004 Proceedings of the 2004 ACM/SIGDA 12th international symposium on 
Field programmable gate arrays 

Full text available: ^pdf{762JAKBj Additional Information: fulJ„cjtatjon, abstract references, index.terms 

This paper presents a new approach to timing optimization for FPGA designs, namely 
incremental physical resynthesis, to answer the challenge of effectively integrating logic and 
physical optimizations without incurring unmanageable runtime complexity. Unlike previous 
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approaches to this problem which limit the types of operations and/or architectural 
features, we take advantage of many architectural characteristics of modern FPGA devices, 
and utilize many types of optimizations including < ... 

Keywords: FPGA, logic synthesis, placement, timing optimization 



4 QptimaiEPGA.^ 

S. Fekete, E. Kohler, 3. feich 

March 2001 Proceedings of the conference on Design, automation and test in Europe 

Full text available: pdf(1S5.17 KB) Additional Information; full citation, references, citings, index terms 



Generating highiy-routabie sparse crossbars for PLDs Q 
Guy Lemieux, Paul Leventis, David Lewis 

February 2000 Proceedings of the 2000 ACM/SIGDA eighth international symposium on 
Field programmable gate arrays 



Additional Information: fuJl.cltatipn, abstract, references, citfoss, index 



Full text available: m pdf(952.31 KB) 

LLJ " terms 

A method for evaluating and constructing sparse crossbars which are both area efficient and 
highly routable is presented. The evaluation method uses a network flow algorithm to 
accurately compute the percentage of random test vectors that can be routed. The 
construction method attempts to maximize the spread of the switch locations, such that any 
given subset of input wires can connect to as many output wires as possible. Based on 
Hall's Theorem, we argue that this increases the likelihood ... 



InnoyMiye AppjjcMQ Q 
Sriram Swaminatfian, Russell Tessier, Dennis Goeckel, Wayne Burieson 
February 2002 Proceedings of the 2002 ACM/SIGDA tenth international symposium on 
Field-programmable gate arrays 

Full text available: ^pd£235,26.KBj Additional Information: MLQiMion, abstract, references., citings 

The use of error-correcting codes has proven to be an effective way to overcome data 
corruption in digital communication channels. Although widely-used, the most popular 
communications decoding algorithm, the Viterbi algorithm, requires an exponential increase 
in hardware complexity to achieve greater decode accuracy. In this paper, we describe the 
analysis and implementation of a reduced-complexity decode approach, the adaptive Viterbi 
algorithm (AVA). Our AVA design is implemented in reconfigu ... 



Keywords: FPGA, Viterbi coding, dynamic reconfiguration 



7 Compilation techniques for embedded applications: Area-efficient instruction set Q 
synthesjsjg 

Philip Brisk, Adam Kaplan, Majid Sarrafeadeh 

June 2004 Proceedings of the 41st annual conference on Design automation - Volume 
OO 

Full text available: ^odf(221.62 KB) Additional Information: full citation, abstract, references, index terms 

Silicon compilers are often used in conjunction with Field Programmable Gate Arrays 
(FPGAs) to deliver flexibility, fast prototyping, and accelerated time-to-market. Many of 
these compilers produce hardware that is larger than necessary, as they do not allow 
instructions to share hardware resources. This study presents an efficient heuristic which 
transforms a set of custom instructions into a single hardware datapath on which they can 
execute. Our approach is based on the classic problems of fi ... 
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Keywords: compiler, field-programmable gate array (FPGA), integer linear programming 
(ILP), resource sharing 



8 OptimaJjty.and„ Q 
Jason Cong, Michail Romesis, Min Xie 

November 2003 Proceedings of the 2003 IEEE/ACM international conference on 
Computer-aided design 

Full text available: ^pdf(3l6.85 KB) Additional Information: full citation, abstract, index terms 

This work studies the optimality and stability of timing-drivenplacement algorithms. The 
contributions of this work include twoparts: 1) We develop an algorithm for generating 
synthetic examples with known optimal delay for timing driven placement(T-PEKO). The 
examples generated by our algorithm can closelymatch the characteristics of real circuits. 
2) Using these syntheticexamples with known optimal solutions, we studied the optimalityof 
several timing-driven placement algorithms for FPGAs by c ... 

9 PoMer„sessjon.2;..A„^ Q 
structuj^^ 

Luca Macchiarulo, Consolato F. Caccamo, Davide Pandini 

April 2004 Proceedings of the 14th ACM Great Lakes symposium on VLSI 

Full text available: ^ pdf(13S,43 KB) Additional Information: full citation, abstract, references, index terms 

In this paper we compare the routing architecture of island-style FPGAs based on field- 
programmable switch boxes with a mask-programmable routing structure, in order to 
assess its position in the design space of routing opportunities available to VLSI IC 
designers. Although the results presented in this work depend on a few implementation 
details that will be discussed in the paper, the mask-programmable routing structure shows 
a large area saving and delay improvement with respect to the field- ... 

Keywords: FPGA, interconnect architectures, mask-programmable, routing 



10 New FPGA architectures: Using bus-based connections to improve field-programmable [Z] 
gate array density for implementing datapath circuits 
Andy Ye, Jonathan Rose 

February 2005 Proceedings of the 2005 ACM/SIGDA 13th international symposium on 
Field-programmable gate arrays 

Full text available: ^fidf(239.^„KB) Additional Information: MLQit3t[on, abstract, references, index terns 

As the logic capacity of Field-Programmable Gate Arrays (FPGAs) increases, they are being 
increasingly used to implement large arithmetic-intensive applications, which often contain 
a large proportion of datapath circuits. Since datapath circuits usually consist of regularly 
structured components (called bit-slices) which are connected together by regularly 
structured signals (called buses), it is possible to utilize datapath regularity in order to 
achieve significant area savings through FPGA a ... 

Keywords: FPGA architecture, area efficiency, datapath regularity, reconfigurable fabric, 
routing architecture 



11 Combinational logic synthesis for LUT based field programmable gate arrays Q 
Jason Cong, Yuzheng Ding 

April 1996 ACM Transactions on Design Automation of Electronic Systems (TODAES), 

Volume 1 Issue 2 
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Full text available: ^pjif£628..M.KB) Additional Information: full citation, abstract, references, citings, index 

ierras. review 

The increasing popularity of the field programmable gate-array (FPGA) technology has 
generated a great deal of interest in the algorithmic study and tool development for FPGA- 
specific design automation problems. The most widely used FPGAs are LUT based FPGAs, in 
which the basic logic element is a K-input one-output lookup-table (LUT) that can 
implement any Boolean function of up to K variables. This unique feature of the LUT has 
brought new challenges to lo ... 

Keywords: FPGA, area minimization, computer-aided design of VLSI, decomposition, delay 
minimization, delay modeling, logic optimization, power minimization, programmable logic, 
routing, simplification, synthesis, system design, technology mapping 



12 §§curjty..Qn Q 

Thomas Wollinger, Jorge Guajardo, Christof Paar 

August 2004 ACM Transactions on Embedded Computing Systems (TECS), volume 3 issue 3 
Full text available: *| | pdf< 296.79 KB) Additional Information: full citation , abstract, references, index terms 

In the last decade, it has become apparent that embedded systems are integral parts of our 
every day lives. The wireless nature of many embedded applications as well as their 
omnipresence has made the need for security and privacy preserving mechanisms 
particularly important. Thus, as field programmable gate arrays (FPGAs) become integral 
parts of embedded systems, it is imperative to consider their security as a whole. This 
contribution provides a state-of-the-art description of security issues ... 

Keywords: Cryptography, FPGA, attacks, cryptographic applications, reconfigurable 
hardware, reverse engineering, security 



13 Architecture analysis and automation: An FPGA architecture with enhanced datapath I I 
functionality 

Katarzyna Leijten-Nowak, Jef L. van Meerbergen 

February 2003 Proceedings of the 2003 ACM/SIGDA eleventh international symposium 
on Field programmable gate arrays 

Full text available: ^ pdfM 88,66 KB) Additional Information: full citation, abstract, references, index terms 

Although FPGAs are a cost-efficient alternative for both ASICs and general purpose 
processors, they still result in designs which are more than an order of magnitude more 
costly and slower than their equivalents implemented in dedicated logic. This efficiency gap 
makes FPGAs less suitable for high-volume cost-sensitive applications (e.g. embedded 
systems). We show that the intrinsic cost of traditional general-purpose FPGAs can be 
reduced if they are designed to target an application domain or a ... 

Keywords: DSP, FPGAs, adder inverting property, application-domain tuning, logic block 
architectures, symmetry 



14 Analysis of FPGA/FPIC switch modules □ 
Yao-Wen Chang, Kai Zhu, Guang-Ming Wu, D. F. Wong, C. K. Wong 
January 2003 ACM Transactions on Design Automation of Electronic Systems 

(TODAES), Volume 8 Issue 1 

Full text available* I Bpdff 508.09 KB) Additional Information: M! cIMlQn, abstract, mferences, index terms, 
^ review 

Switch modules are the most important component of the routing resources in 
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FPGAs/FPICs. Previous works have shown that switch modules with higher routability result 
in better area performance for practical applications. We consider in this paper an 
FPGA/FPIC switch-module analysis problem: the inputs consist of a switch-module 
description and the number of nets required to be routed through the switch module; the 
question is to determine if there exists a feasible routing for the routing requir ... 

Keywords: Computer-aided design of VLSI, FPGA, FPIC, layout, synthesis 



15 Fast placement approaches for FPGAs Q 
Russell Tessier 

April 2002 ACM Transactions on Design Automation of Electronic Systems (TODAES), 

Volume 7 Issue 2 

Full text available: ^pdf(606.13 KB) Additional Information: full citation, abstract, references, index terms 

Recent trends in FPGA development indicate a strong shift toward design reuse through the 
use of intellectual property (IP). This design shift has motivated the development of 
Frontier, a timing-driven FPGA placement system that uses design macroblocks in 
conjunction with a series of placement algorithms to achieve highly routable and high- 
performance layouts quickly. In the first stage of design placement, a macro-based 
floorplanner is used to quickly identify an initial layout based on interma ... 

Keywords: Computer-aided design of VLSI, field-programmable gate arrays, layout, 
synthesis 
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terms 

Tools and a design methodology have been developed to support partial run-time 
reconfiguration of FPGA logic on the Field Programmable Port Extender. High-speed Internet 
packet processing circuits on this platform are implemented as Dynamic Hardware Plugin 
(DHP) modules that fit within a specific region of an FPGA device. The PARBIT tool has been 
developed to transform and restructure bitfiles created by standard computer aided design 
tools into partial bitsteams that program DHPs. The methodolo ... 

Keywords: FPG, IP, Internet, hardware, modularity, network, packet, partial RTR, platform 
computing, reconfiguration, routing 
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Full text available: pdf(2Q5.40 KB) Additional Information: full citation, abstract, references, index terms 

Moore's Law states that the number of transistors on a device doubles every two years; 
however, it is often (mis)quoted based on its impact on CPU performance. This important 
corollary of Moore's Law states that improved clock frequency plus improved architecture 
yields a doubling of CPU performance every 18 months. This paper examines the impact of 
Moored Law on the peak floating-point performance of FPGAs. Performance trends for 
individual operations are analyzed as well as the performance tr ... 
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tables 

Amit Chowdhary, John P. Hayes 

January 2002 ACM Transactions on Design Automation of Electronic Systems 

(TODAES), Volume 7 Issue 1 
Full text available: Qodfn.Q7 MB) Additional Information: full citation, abstract, references, index terms 

We present a general technology-mapping methodology (TULIP) for field-programmable 
gate arrays (FPGAs) that can yield optimal results, and is applicable to any FPGA with a 
logic block composed of lookup tables (LUTs). We introduce the concept of a virtual switch 
to model the internal connections of a logic block with multiple LUTs; each configuration of 
virtual switches is called a multiple-LUT block (MLB). A logic block can be precisely defined 
by a small but complete set of representative conf ... 

Keywords: Basis, circuit partitioning, field-programmable gate arrays, lookup tables 
(LUTs), mapping, multiple-LUT blocks, nonrooted trees, rooted trees, synthesis 
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Full text available: l Ppdf(179.95 KB? 

v^"^ terms 

We present a pipelining-aware router for FPGAs. The problem of routing pipelined signals is 
different from the conventional FPGA routing problem. For example, the two terminal N- 
Delay pipelined routing problem is to find the lowest cost route between a source and sink 
that goes through at least N (N > 1) distinct pipelining resources. In the case of a multi- 
terminal pipelined signal, the problem is to find a Minimum Spanning Tree that contains * 
sufficient pipelining resources such that the del ... 

Keywords: BFS, PipeRoute, minimum spanning tree, pipelined circuits, pipelining, routing, 
retimed circuits, retiming 
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